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Abstract 
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1 Introduction 


Having a wide range of applications in science and engineering (e.g., van Kampen [27]), 
singularly perturbed systems, have been investigated extensively recently; see, for instance, 
Freidlin-Wentzell m, and Yin-Zhang [29]. Singularly perturbed systems usually exhibit 
multi-scale behavior owing to inherent rates of changes of the systems or different rates 
of interactions of subsystems and components. To reflect the slow and fast motions in 
the underlying systems, a time-scale separation parameter e G (0,1) is often introduced. 
Due to the multi-scale property, it is frequently difficult to deal with such systems using 
a direct approach. As a result, it is foremost important to reduce their complexity. The 
averaging principle pioneered by Khasminskii [T8| for a class of diffusions provides an effective 
way to reduce the complexity of the systems. For systems in which both fast and slow 
components co-exist, the idea of the averaging approach reveals that there is a limit dynamic 
system given by the average of the slow component with respect to the invariant measure 
of the fast component that is an ergodic process. The averaging equation approximates the 
slow component in a suitable sense whenever e 0 leading to a substantial reduction of 
computational complexity. The work [18] by Khasminskii inspired much of the subsequent 
development. To date, there have been a vast literature on the study of for multi-scale 
stochastic dynamic systems (see, e.g., the monograph [IS]). For strong/weak convergence in 
averaging principle, we refer to, e.g., Givon et ah [13], Liu [22], Liu-Yin [23], and Yin-Zhang 
[29] for stochastic differential equations (SDEs), and Blomker et al. [1], Brehier [5], Cerrai 
[Gj, Fu et al. [12], and Kuksin-Piatnitski [19] for stochastic partial differential equations 
(SPDEs); With regarding to numerical methods, we refer to, e.g., E et al. [TOj and Givon 
et al. [H]; As for related control and hltering problems, we mention, e.g., Kushner [201 1^ - 
Goncerning large deviations, we refer to, e.g., Kushner [20], and Veretennikov [28] . 

The aforemention references are all concerned with systems without “memory”. Never¬ 
theless, more often than not, dynamic systems with delay are un-avoidable in a wide variety 
of applications in science and engineering, where the dynamics are subject to propagation 
of delays. In response to the great needs, there is also an extensive literature on functional 
SDEs; see, e.g., the monographs [2H [25] . 

In contrast to the rapid progress in two-time-scale systems and differential delay equa¬ 
tions, the study on averaging principles for functional SDEs is still in its infancy. Gompared 
with the existing literature, for such systems, one of the outstanding issues is the phase space 
of the segment processes is inhnite-dimensional, which makes the goal of obtaining a strong 
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limit theorem for the averaging principle a very difficult task. This work aims to take the 
challenges and to establish a strong limit theorem for the averaging principles for a range of 
two-time-scale functional SDEs. 

The rest of the paper is organized as follows. Section [2] presents the setup of the problem 
we wish to study. The ergodicity of the frozen equation with memory is obtained in Section 
|3l Section 0] constructs some auxiliary two-time-scale stochastic systems with memory and 
provides a number of preliminary lemmas. Section |5] derives a strong limit theorem for the 
averaging principle in the spirit of Khasminskii’s approach for the slow component. 

Before proceeding further, a word of notation is in order. Throughout the paper, generic 
constants will be denoted by c; we use the shorthand notation a <b to mean a < cb, we use 
a <T b to emphasize the constant c depends on T. 


2 Formulation 

For integers n,m > 1, let (M"", | • |, (•, •)) be an n-dimensional Euclidean space, and M” <8) M™' 
denote the collection of all n x m matrices with real entries. For an A G M” ( 8 ) M™', ||A|| 
stands for its Frobenius matrix norm. For an interval I C (—cxo,cxo), means the 

family of all continuous functions from I M"'. For a fixed r > 0, let = C'([—r, 0]; M"'), 
endowed with the uniform norm || ■ ||oo. For h{-) G C'([—r, cxo); M”) and t > 0, define the 
segment h* G ^ by ht{6) = h{t + 6), 6 ^ [— r, 0]. 

Introducing a time-scale separation parameter e G (0,1), we consider two-time-scale 
systems of functional stochastic differential equations (SDEs) of the following form 

(2.1) dX%t) = bi{Xl Yf)dt + ai{X^)dWi{t), t>0, ^ G 

and 

(2.2) dY%t) = - 62 (Xf, y^(f), Y^{t - T))dt + ^CT2(X,^ Y%t), Y%t - r))dIF2(f), t > 0 

with the initial value Y^ = rj ^ where bi : ^ x ^ i-^ M"', 62 : x M"' x M"' 1 -^ M"', 

(Ti : ^ 1 -^ M” 0 (72 : X M"' X M"' I—)■ M"' (g) are Gateaux differentiable, (hFi(t))t>o 

and {W 2 {t))t>o are two mutually independent m-dimensional Brownian motions defined on 
a probability space (i7, .^,P), equipped with a family of filtrations satisfying the 

usual conditions (i.e., for each t > 0, ■= ns>t and contains all P-null sets). 

As usual, for two-time-scale systems fl 2 .ip and 02 . 21 ) . X^(t) is called the slow component, 
while Y^{t) is called the fast component. 
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We denote by the gradient operators for the i-th component. Thronghout the paper, 
for any y, 0 G and x, x', y, y' G M", we assnme that 

(Al) V&i = is bonnded, and there exists an L > 0 such that 

\bi{x, (p)\ < L{1 + WxWoo) and ||ai( 0 ) - cti(x)|| < L ||0 - xlU- 

(A2) V 62 = (V(i) 62 , V( 2 ) 62 , V( 3 )fe 2 ) and Vaa = V( 2 )a 2 , are bounded. 

(A3) There exist Ai > A 2 > 0, independent of y, such that 

2{x - x’,b 2 {x,x,y)-b 2 {x,x\y')) + \\a 2 {x,x,y) - a 2 ix, x',y')\\^ 

< -Ai|x - x'p + A 2 I 1 / - y'\^. 

(A4) For the initial value Xq = ^ G of fl2.ip . there exists a A 3 > 0 such that 

\^it) - ^{s)\ < Xslt - s\, s,tG[-r,0]. 

Let us comment the assumptions (A1)-(A4) above. From (Al) and (A2), the gradient 
operators V 61 , Vb 2 , and Vcr 2 are bounded, respectively, so that bi, 62 , and a 2 are Lipschitz. 
Then, both fl2.1l) and fl2.2p are well posed (see, e.g., [211 Theorem 2.2, P.150]). While, 
(A3) is imposed to analyze the ergodic property of the frozen equation (see Theorem 13.11 
below), guarantee the Lipschitz property of 61 (see Corollary 13. 2l below). dehned in fl3.3p . and 
provide a uniform bound of the segment process {Yf)t^[Q^T] (see Lemma 1173] below). Next, 
(A4) ensures that the displacement of the segment process (Xf)tg[o,r] is continuous in the 
mean L^-norm sense (see Lemma [4. II below). 

3 Ergodicity of the Frozen Equation with Memory 

Consider an SDE with memory associated with the fast motion while with the frozen slow 
component in the form 

(3.1) dY{t) = b2{C,Y{t),Y{t-T))dt + a2{C,Y{t),Y{t-T))dW2{t), t > 0, Y, = ye^. 

Under (A2), fl3.ip has a unique strong solution (Y(t))t>-T (see, e.g., [211 Theorem 2.2, 
P.150]). To highlight the initial value y E and the frozen segment ( E we write the 
corresponding solution process {Y‘^{t,ri))t>-T and the segment process {Y^{ri))t>Q instead of 
{Y{t))t>-r and (T))t>o, respectively. 

Our main result in this section is stated as below. It is concerned with ergodicity of the 
frozen SDE with memory. 
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Theorem 3.1. Under (A2) and (A3), Y^irj) has a unique invariant measure /r'’, and there 
exists A > 0 such that 

(3.2) |H,i(C,!"/(>;))-6i(C)l<<i“"‘(i + lklU + IICIU). «>o, 

where 

(3.3) Uc) f ce*”. 

Jv 

Proof. The main idea of the proof concerning existence of an invariant measure goes back 
to [U Lemma 2.4], which, nevertheless, involves functional SDEs with additive noises. 

Let ^(f^) be the set of all probability measures on 14^2 denotes the L^-Wasserstein 
distance on ^{f€) induced by the bounded distance p{^,ri) := 1 A ||.^ — ?7||oo, he., 

W 2 ipi,p 2 )= inf (vr(p2))2, /xi,/ia G 

where ^{pi,p 2 ) is the set of all coupling probability measures with marginals pi and p 2 - If 
is well known that is a complete metric space w.r.t. the distance IU 2 (0 Lemma 5.3, 

P.174] and [71 Theorem 5.4, P.175]), and the convergence in IU 2 is equivalent to the weak 
convergence (B Theorem 5.6, P.179]). Let be the law of the segment process Y^{rj). 
According to the Krylov-Bogoliubov existence theorem ([9l Theorem 3.1.1, P.21]), if P/’’’ 
converges weakly to a probability measure /i^, then /i^ is an invariant measure. So, in light 
of the previous discussion, it suffices to prove the assertions below: 

(i) {Pt^} i>o is a Cauchy sequence w.r.t. the distance 144- If so, by the completeness of 

w.r.t. the distance 144, there is G .^('^) such that hmt^.oo 1L2(P/’^,/i^) = 0; 

(ii) 144(h^,h^/) = 0 for any rj^rj' and frozen ( E'lf, that is, /i^ is independent of r]. 

In the sequel, we shall claim that (i) and (ii) hold, respectively. For any t 2 > P > r and the 
frozen segment ( E consider the following SDE with memory 

(3.4) dF(t) = b2{C,Y{t),Y{t - T))dt + a2(C,F(f),F(t - r))dl4^2(f), f G [^2 - ^ 1 ,^ 2 ] 

with the initial value Y= rj. The solution process and the segment process associated 
with fl3.4l) are denoted by {Y^{t,ri)) and {Y^{ri)), respectively. Observe that the laws of 
Y^^{r]) and Y\^{rf) are P(^’^ and P(^’^, respectively. 
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By (A2), there exists an a > 0 such that 


(3.5) \\a 2 {x,x,y) - a2(x, x', |/')|| < a(|x-x'| + \y-y'\), 
and 

(3.6) |52(x, 0,0)1 + ||(T 2 (x, 0,0)11 < a(l + ||x||oo) 

for any y G ^ and x,x'^y^y' G M". Accordingly, fl3.5l) and fl3.6p . together with (A3), yield 
that there exist A'^ > A2 > 0, independent of y, such that 

(3.7) 2(x,62(y,2:,|/)) + Ik2(y,a;,|/)|p < -A)|xp + A2I1/P + c(l + ||y|^) 

for any y G and x, y G M”'. For a sufficiently small A' > 0 obeying A'^ — A' — ~ 0 

due to A'^ > A2 > 0, applying Ito’s formula, we infer from fl3.7l) that 

e«E|r':(t.,)p < |,(0)p + A"'”E{c(1 + IICIIL) + 

- A'i|h"^(s,?7)P + A'2|y^(s -r,r7)|2}ds 

<lhllL + e"'*(l + IIC|lL), ^>0. 

Consequently, we arrive at 

(3.8) 


Also, by the Ito formula, in addition to the Burkhold-Davis-Gundy (B-D-G for abbreviation) 
inequality, we derive from (A3), and fl3.5p - fl3.8l) that, for any t > t, 

ny,Hv)\\i 

<1 + 


IL 


2 

00 


E|F«(«-T,r,)p+ / E|y<(s,r,)pds 


(3,9) 


' t — 2T 


2E ( sup 

' t — T<S<t 


(y^(s, y), a2(C, v), >"^(s - B ??))dhF2(s)) 


' t—T 




' t— 2 r 


On the other hand, following the argument leading to fl3.9p . one has 

1 


(3.10) E\\Y,^{y)C<-E\\Y^ 


+ cn + 


+ \\v\\lo + / E|r^(s,r/)|2dsL tG[0,r]. 
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Thus, combining fl3.8p with fl3.9p and fl3.10p leads to 


(3.11) E||y/(,)||^ < c(e-«||.,||^ + 1 + IICIIL). 

In what follows, we assume t G [^2 — ti, ^ 2 ], and set rj) := Y^{t, rj) — Y^{t, rf) for the 
sake of notational simplicity. Again, for a sufficiently small A > 0 such that Ai — A —A 2 e^'^ = 0 
owing to Ai > A 2 , by the Ito formula, it follows from (A2) that 

+ r e^^E{{X-X,)\r^{s,v)\^ + X2\r^{s-T,v)\^}ds 

Jt2-h 

< eAh 2 -ti)E|^C(^ 2 -ti,r /)|2 + e^^ ' e^"E|r^(s,r 7 )|Ms 

J t2—tl—T 

This, together with (13.lip , yields that 

(3.12) E|rf(«, < e-'«+‘.-«(l + ||,,||^ + IICIIL), 

Imitating a similar procedure to derive fl3.9p . in particular, we obtain from (A2), fl3.5p . and 
(13.12p that 

(3.13) E||r,y,)||^ < e-^“(l + Ml + IlCllt). 

This further implies that 

VU(Py, Pl”) < E{1 A \\Y,l(v) - Kml} I e“""(l + ll>)llL + IICIIL). 

which goes to zero as ti (hence ^ 2 ) tends to cx). Thus claim (i) holds. 

By carrying out a similar argument to obtain fl3.13p . one hnds that 

(3.14) E||K/(r,) - r/(r/)||^ < e-i‘||r, - 


For hxed C and arbitrary ri,ri E observe that 

(3.15) W2(4a.P < W2(Fy,4) + W2(Pl'’',^f,,) + W2(Pl’',Pl'’'). 

Consequently, claim (ii) follows by taking (13.141) and (I3.15P into consideration. 
By virtue of (13.lip and the invariance of /r^, it then follows that 


2 

00 


7 r‘’(d'0) < c< 1 + 


+ e 


—Xt 


llV'llL7rHdV')|. 
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Thus, choosing t > 0 sufficiently large such that 6 := ce < 1, one finds that 

(3-16) / ll<fll>‘(dV-) < 1 + IICIIL 

Next, with the aid of the invariance of tt'’, fl3.14p . and fl3.16p . we deduce from (Al) that 
|E6i(C,>"/(h))-^i(C)l < [ - >"/(^)l|ooVr^(ciV’) < e"^ [ \\r] - i:\\^7r‘^{d'ijj) 

<e-^(l + |h|U + ||C||oo). 

As a result, fl3.2p follows. □ 

Remark 3.1. It should be noted that there are other alternative approaches to obtain ex¬ 
istence and uniqueness of invariant measures for functional SDEs. Regarding to existence 
of invariant measures, Es-Sarhir et al. [S], and Kinnally-Williams na by Arzela-Ascoli’s 
tightness characterization, Bao et al. [2] using a remote start method, Bao et al. [3] adopt¬ 
ing Kurtz’s Tightness Criterion, and Rei/9 et al. [26] by considering the semi-martingale 
characteristics. As for uniqueness of invariant measures, we refer to Hairer et al. na, and 
Kinnally-Williams na by utilizing an asymptotic coupling method. 

The next corollary, which plays a crucial role in discussing strong limit theorem for the 
averaging principle, states that 6i, defined by (13.3p . enjoys a Lipschitz property. 

Corollary 3.2. Under (A1)-(A3), 6i : ^ i-^ M”, defined as in (13.3p . is Lipschitz. 

Proof. For arbitrary (j),C, let 

V*5,(C) = T5i(C + £^) 

de e=0 

be the direction derivative of hi at C, along the direction 0. By Theorem 13.11 we have 
VA(C)= lim 

t^OO 

According to (Al), to verify that hi : ^ M"" is Lipschitz, it remains to verify 

supE||V,^y/(r7)||^ < cx). 
t>o 
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(3,17) 













Observe that V(f,Y‘^{t,ri) satisfies the following linear SDE with memory 

+ (V?hc«,,)'>2) (C. yHt, ri), Y((t -T,ri)) 

+ ((• v), YHt - T, i;))}di 

+ {(v5,'V2)(C,y«(t.r,),y«(i-r,.,)) 

+ K' v), YHt - T, ri)) 

with the initial data VtpYQ^r]) = 0. In the sequel, let y e and x^x'^y^y' G M”. For any 
£ > 0, it is trivial to see from (A3) that 

2 e{x,h 2 {x,x' + ex, y' + ey) - h 2 {x,x',y')) + \\(T 2 {x,x' + ex, y' + ey) - (T 2 {x,x', y')\\^ 

<-Xie^^ + \2e^\y\\ 

Multiplying e~‘^ on both sides, followed by sending e J, 0, gives that 
2 {x, (Vf 62 )(x, x', if) + (Vf 62 )(x, x', y')) 

(3.18) + ||(VPV2)(x,i',y') + iyf<y2)i.x,x',v')f 

< —Ai|xp + A2|yP- 

On the other hand, by virtue of fl3.5p . for any e > 0, one has 

\\(j 2 {Xix’ + ex,y’ + ey) - cr 2 (x, x', < ae‘^{\x\^ + \y\^), 

which further yields that 

(3.19) \\{V^^'^(T 2 ){Xix’,y’) + {Vf(T 2 ){Xix',y’)f < a{\x\^ + \y\^). 

Thus, with fl3.18l) and fl3.19p in hand, (13.171) holds by repeating the argument which (13.111) 
is obtained. □ 

4 Preliminary Results 

In this paper, we study the strong deviation between the slow component X^{t) and the 
averaged component X{t), which satisfies the following functional SDE 

(4.1) dX{t) = \(Xt)dt + ai(Xt)dWi{t), Xo = e e 
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where 6i : ^ M" is defined as in fl3.3p . To achieve this goal, we need to construct 
some auxiliary two-time-scale stochastic systems with memory and provide a number of 
preliminary lemmas. 

Throughout this paper, we fix T > 0 and set 5 := E (0,1) for a positive integer N 
sufficiently large. For any t G [0,T], consider the following auxiliary two-time-scale systems 
of functional SDEs 


(4.2) 

and 


dA''(() = 6,(A', Yi)At + (A')<!»■,((), AJ = f € «■, 


(4 3) /“ h2(Ay y'(i), y’(t - r))dt + ;1;<T2(A4, y'(«), Y’(t - T))dW 2 (t), 

[Y’its) = Y’(ts) 

with the initial value Yq = Yq = rj E ‘Y’, where ts := \ t/5\5^ the nearest breakpoint preceding 
t, with being the integer part of t/6. 

To proceed, we present several preliminary lemmas. The first lemma concerns the conti¬ 
nuity in the mean L^-norm sense for the displacement of the segment process (Xf)tg[o,T]- 

Lemma 4.1. Under (Al) and (A4), 


-P-2 


sup E||A,'-A,'||(^<Ti—, p>2, 

te[o,T] 

Proof. In accordance with pTl Theorem 4.1, P.160], we have 


(4.4) 

Observe that 


e( sup ||A,^||^) <t1+ 11^11 

^ 0<t<T y 


P 

oo' 


N-1 


E||A,'-A'||^< We( sup |A'(i + 9)-A'(ij + 

m =0 ^ -im+l)5<e<-mS 

Af-1 

=; ^ Jp{t,m,6), 


rn=0 


where N = t/6 hy the definition of <5. To complete the proof of Lemma [4.11 it is sufficient 
to show 


(4.5) 


Jp{t,m,6) <T 52 . 
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For any t G [0, T], take k > 0 such that t G [kS, (fc + l)(5). Thus, for any 6 G [—(m + l)5, —m6], 
one has 

t 0 G [{k — m — 1)5, (fc + 1 — m)6] and ts + 0 G [{k — m — 1)5, {k — m)5\. 

In what follows, we consider three cases. 

Case 1: m < k — 1. Invoking Holder’s inequality and B-D-G’s inequality, we obtain from 
(Al) and fl4.4p that 

5) 


pt—mS , pt-\-0 

<5"-M E|5i(X|,r/)rds + E( sup 

J {k—m—l)S ^—{m-\-l)5<6<—m5 J kS-\-9 

pt—mS pt—{m+l)S 

<5^-1 / E|&i(X|,r/)rds + E 

J {k—m—l)5 

rt+e 


<7l(Xt)dWl(s) 


(4.6) 


El sup 

— {m+l)S<9<—mS 


' {k—m—l)S 
P 


<Ti(X,')dVFi(s) 


' t—(m+l)5 
rkS-\-e 


<T,(Jf()dVF,(s) 


<ri(A';)dH^i(s) 


E| sup 

■—{m+l) 5 <d<—mS J {k—m—l)S 
p-mS p-{m+l)S 

< 5 ^-^ / E|5i(X|,y/)rds + 5^E( / lki(A,^)|rd5 

J{k—m—l )6 ^ J {k—m—l )5 

dt-mS ^ j-{k-m )5 ^ ^p /2 


E 


S,T 5 2. 


' t—{m+l)5 


ki(^l)irds)" +e( / ||ai(Xf)||^ds 


Case 2: m > k + 1. In view of (A5), it follows that 


\X%t + 9)- X^ts + 9)\P = |e(t + 9)- + 0)r < 5^. 


Case 3: m = k. Also, by Holder’s inequality and B-D-G’s inequality, we deduce from (Al) 
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and fl4.4p that 


(4.7) 


Jp{t,m,6) =E,( sup \X'^ {t + 6) — X'^ {k6 + 6)1^') 

^-{k+l)S<e<-kS ' 

< <5^ + e( sup i\X%t + 6)- X^(0)rip+,>o})) 


<(5p + e( sup 

^ —t< 6 <—k 5 

+ E( sup 


— {k-\-l)S<0<—kS 

t+6» 


()i(XJ,K/)ds 


'0 

pt+ 6 » 


—t<0<—kS 
t—kS 




< (5^ + 6^-^ 

<T 52 , 


P-2 


f*t—kS 


nb,{x!,Y:)fds+6- 


E||ai(Xf)rds 


where a'^ := niax{a, 0} for a G M. Consequently, the desired assertion fl4.5p is hnished by 
taking the discussions above into account. □ 


The lemma below provides an error bound of the difference in the strong sense between 
the slow component (X^(f)) and its approximation (X^(f)). 

Lemma 4.2. Assume that (Al) and (A2) hold and suppose further e/6 G (0,1). Then, 
there exists (3 > 0 such that 

e( sup \X%t)-X%t)\A <T5"-^{l + e-^ef), p > 2. 

Proof. In view of Holder’s inequality and B-D-G’s inequality, it follows from (Al) and 
Lemma 14.11 that 

e( sup |X^(s)-A^(s)r) <T fE{\\XI-Xl^r^+\\Yf-Yf\Uds 

^ 0 <s<t 2 Jo 

<t 6^+ fniYf-AToods, tG(0,T]. 

Jo 

Therefore, to hnish the argument of Lemma 14.21 it suffices to show that there exists (3 > 0 
such that 

(4.8) sup E||F/ - y/ll^ <T 

te[o,T] 

In what follows, we verify claim fl4.8p by an induction argument. For any t G [0,r), due to 
Yq = Yq = ? 7 , it is readily to check that 

h/h 

E||v;'-y,'||S„< We( sup |y'(s)-y'(s)|'')=:/(t.i). 

j =0 ^i'5<«<(0+l)'5)At ^ 
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By means of Ito’s formula and B-D-G’s inequality, together with Y^{ts) = Y^{ts), we obtain 
from (A2) that 


E| sup 

' jS<s<(ij + l)S)At 




< £ 


{E||Af - 11^ + E|r^(s) - Y%s)nds 


'jS 

-E ( sup 

2 i<5<s<((j+l)<5)At 


r^(s)-F^(s)r , te[0,r]. 


Consequently, we conclude that 


(4.9) 


/((,4)<i / E||.Y;-A%||^di,+ 


.5 h/5j 


- [ ^E^ sup |y^(r) — y^(r)|^)ds 
^ Jo ^ jS<r<{{jS+s))At 


< 

rsj 


E||A,'- XJJiys + - ; I(t,s)ds. 

£ .In 


This, combining Lemma [4. II with Gronwall’s inequality, gives that 


(4.10) 


E|IU - U 


IP 

loo 


.-1 


P-2 

(5V( 


t e [0, r) 


for some c > 0. Next, for any t G [r, 2r), thanks to 04.101) . it is immediate to note that 


E||T/-r/||^<E(||F/-F/||^)+E( sup |r^(s) - ^^(5)^) 

p-rj ^ 

< c\e-^S'-^e'£ + V e( sup |y"(s) - y"(s)|P 

^ ^ ^ (iV+i)5<^<((Af+i+l)<5)Ai 

=: c{e~^S^e~ + M{t, r, 5)}. 

Carrying out a similar argument to derive 04.Oh . we deduce from Od.lOj) that 

M (t,T, 6 ) I E\\Xl - X’Jlds 

1 Li-C _ 

+ -/ Ve( sup |r"(r)-y^(r)|P)ds 

£ Jo j_Q ^ {N+j)S<r<{{N+j)5+s)At ' 

1 X 

+ -/ Ve( sup |y^(s)-y"(s)|^’)ds 

^ Jo j^Q ^ jS<s<{ij+l)S)A{t-T) ^ 

^ ^ 1 

<-^- e^+- M{t,T,s)ds. 

£ £ £ £ Jo 
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Thus, the Gronwall inequality reads 




r 

0 2 0 0 2 cS 

- 1 -- - 

see 


r ^E —^ rE —^ 

ci5 0 0 2 c5 0 2 c5 

e ^ ^e ^ 5, - •-e ^ 5, - 


-e £ 


£ 


where we have used e/5 E (0,1) in the second step. Finally, fl4.8p follows by repeating the 
previous procedure. □ 


The following consequence explores a uniform estimate w.r.t. the parameter e for the 
segment process associated with the auxiliary fast motion. 


Lemma 4.3. Assume that (Al) and (A3) hold. Then, there exists Ct > 0, independent of 
e, such that 

(4.11) sup E||y,'||^ < Ct. 

te[o,T] 


Proof. From fl2.2p . it follows that 


ft/e 


(4,12) 


y'(«)=r((0)+ / b 2 (X‘„Y‘(ea),Y%es -T))dt 

Jo 

+ [ a 2 {XI^,Y%es),Y%es - T))dW 2 {s), t > 0, 


where we used the fact that W{t) := ^hF 2 (^^) is a Brownian motion. For hxed e > 0 and 
t > 0, let + 6^) = Y^{et + 0), 9 e [—r, 0]. So, one has Y^ = Yf^. Observe that (14.121) can 
be rewritten as 


y'((/s) = r,(0) + 


rt/e 


I't/e 


b 2 {XI^X{s)X{s-T))ds+ / a 2 {XI^X{s)X{s-T))dW 2 {s). 

Jo Jo 

Then, following the argument to obtain (13.111) . for any s > 0 we can deduce that 

EllFllL ;£ 1 + + e( sup ||x;ii: 

This, together with Y^ = Yfi, gives that 

Er/JlL<l + lhllLe-"* + E( sup \\X/\\l). 

In particular, taking s = t/e we arrive at 

Er/||L<l + ||r/||L + E( sup ||A,^||^). 

^0<r<t J 
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This, together with fl4.4p . yields that 


sup E||y/||^ < Ct 

t£[Q,T] 

for some Ct > 0. Observe from fl4.8p and Hoder’s inequality that 

E||y,'|iL < 2E||y* - y/iiL + 2E||y,'|iL 

/ 1 p-2 /35\2 /p 

<r 1 + 5 2 e ^ j , p > 4. 

Next, taking 5 = £(—Ine)^ in the estimate above and letting y = (—Ine)^, we have 

E|V,'II» ;£t 1 + p > 4. 

Then, the desired assertion follows since the leading term y)^ e^^ —?■ 0 as p t C )0 

whenever p > 4. 

□ 

5 A Strong Limit Theorem for the Slow Component 

With several preliminary lemmas at our hands, we are in position to present our main result. 

Theorem 5.1. Under (A1)-(A4), one has 

limEf sup \X%t)-X{t)\A =0, p > 0. 

£^■0 V 0<t<T ' 

Proof. For any t G [0, T] and p > 0, set 

A{t):=E( sup |A^(s) — A(s)|^') and r{t) :=¥,( sup |A^(s) — A(s)|^y 
^o<s<t ' Vo<s<t ' 

By Holder’s inequality, it is sufficient to verify that 

(5.1) limyl(T) = 0, p > 4. 

£^>0 

In what follows, let t G [0,T] be arbitrary and assume p > 4. For any t G [0,T], it follows 
from Lemma [4.21 that 

(5.2) A{t)<E( sup \X%s)-X%s)\A+r{t)<6"-^(l + -ef)+r{t). 

V o<s<t 2 \ e / 
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Next, if we can show that 


(5.3) r(t) < 5^'(^1 + V/) + A{s)ds 

for some u G (0,1), inserting fl5.3p back into fl5.2p and utilizing Gronwall’s inequality, we 
deduce that 

Thus, the desired assertion (15.1 p follows by choosing <5 = £(— ln£) 2 . Indeed, it is easy to see 
that e/5 G (0,1), which is prerequisite in Lemma [4.21 for e G (0,1) small enough, and that 
(5 —)■ 0 as £ 0. Furthermore, let y = (— lne )2 (hence £ = e~^ ), which goes into inhnity as 

e tends to zero. Then, we have 

Ait)<ie-y^yy-^(^l + ey"+^y)+y-\ 


which goes to zero by taking p > 4 and letting y oo. 
Next, we intend to claim (15.3p . Set 


rp((,i,e):=E sup / {b,(X‘,Y‘)-b,(X‘)}dr , (elO,r|, 


0<S<t JQ 


Applying Holder’s inequality, B-D-G’s inequality, Lipschitz property of bi due to Gorollary 
and Lemma 14.11 we derive that 


r{t)<E( sup / {6i(X|^,y/)-6i(X,)}ds + / E||ai(X|J-ai(X,)rds 


0<s<t Jo 


<r,{t,5,e)+ / E|6i(X|J-5i(X|)rds+ / E|6i(X|) - 6i(X|)rds 

Jo Jo 

+ [ EMX!)-h(Xs)\^ds+ f E||ai(X|J-ai(X)rd5 

Jo Jo 

<r,{t,S,E)+ [ E||X,'- ifjIUds + [ E||X' -A';||Lds+ f /’(s)ds+ [ d(s)ds 


>0 


>0 


< 5^2 + + rp{t,5,e) + [ r{s)ds+ [ yl(s)ds. 


'0 


'0 


which, together with Gronwall’s inequality, leads to 

(5.4) -^(^) ^ <5 2 ^1 +-e s ^ + Tp(t, (5, e) + [ yl(s)ds. 
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where we have utilized the fact that rp{t,6,e) is nondecreasing with respect to t. By com¬ 
paring fl5.3p with fl5.4p . we need only prove 


(6.5) 

for some v E (0,1). 

Let 

Tp{k,6,e) =E 


rp{t,S,e) < 




' kS 


{h(XU,Y,‘)-UXU)}ds 


for any p > 0. 


In the sequel, we show that flS.Sp holds. By Holder’s inequality, we obtain that 

I ri{k+mAt _ _ p 

rp{t,6,e)=E[ sup I ^ / {bi{Xls,Y^^) - bi{Xl^)}dr 


0<s<t I JkS 


(5.6) 


0<s<t 




<e( sup (Ls/5J+1)>’-‘5^U(«:,5,, 


k=0 


[t/5\ 


< 


([i/5J+l)’’-‘5^U(fc,6,£) 


k=0 


< ([f/5J-|-1)^ max Tp{k,6,e). 

0<k<lt/6\ ^ 

For any p' G (1,2), by Holder’s inequality, (Al), and (14.41) . observe that 


Tp{k,6,s)<T2{k,6,sp[E 


“((fc-|-l)<5)At 


' kS 


u' / 2(p-p') , 

< r2(k,S,E)T{s~^^'E 


MXis,Y‘)-b,(Xl,)}da 

p{{k-\-l)S)At 
'fc(5 


2(p-p') 2-p' 

2-p' \ \ 2 


|6i(V|j,K/)-5,(xa|^ds 


r ^ sL , w 2-p' 

r\j 


T2{k,6,e)^6^X^^~ 

<nik,6,sy^6P-P\ p>4. 

Substituting this into (15.6p . we arrive at 

r,(f,5,£)<r2(fc,5,£)^r"' 

Thus, to complete the argument, it remains to show that 


2-p' 

2 


T2{k,6,e)<e5. 
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Also, by virtue of Holder’s inequality, (Al), and fl4.4p . we derive that 


(5.7) 


n{k,6,e) 

|•{{k+l)S)At r{{k+l)S)At 
= 2 / 


J kS 

r{k-\-l)5 


- 6i(X^,),5i(A'|j,y;*) - 5i(At'j))drds 


< 


' kS 


(E|E((6i(AJ„y/) -5, (XI,))m\-^) ‘/Mrdi,. 


For any r G [k6, {k + 1)(5), by the definition of defined as in fl4.3p . it follows that 
y'(r) = y'(fcd) + i f\(Xl„Y’iu),Y^u-T))du 


(5.8) 


' kS 


+ 




kS 


a2{Xls,Y%u),Y^{u-T))dW2{u) 


= Y‘^{k6)+ / b2{Xlg,Y^{k5 + eu),Y^{k6 + eu — T))du 


+ 


a 2 {Xls, Y%k5 + eu- T))dW 2 {u), 


where W 2 {u) := (W 2 {eu + k6) — W{k5))/ y^, which is also a Wiener process. For fixed e > 0 
and M > 0, let 

Y^^^\u + e) = Y%k6 + eu + e), ee [-T, 0], 

Then fIS.Sp can be rewritten as 
vXt, (r - kb-' 


Y 


= Y%k5) + I b 2 { Xls, -T))du 


+ a2 -(n),r -(n - r)j dW2{u) 

Consequently, by the weak uniqueness of solution, we arrive at 


(5.9) 


xt 


^iYn=^K%/,{Y^s) I 


where =^(C) denotes the law of random variable (. Finally, we obtain from fl3.2p . (15.7p . (15.9p . 
and Lemma 14.31 that 


T 2 {k,S,e)<{l + E\\Xls\\l + E\\Y,^s 
<eS. 

The proof is therefore complete. 


e ||2 5 
ooJ 


-(/c+l)(5 r{k-\-l)S 


exp 


' kS 


c{r — kb) 


drds 


□ 
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Remark 5.1. In this paper, we only focus on the case, where the diffusion coefhcient of the 
slow component is independent of the fast motion. For the case that the slow component 
fully depends on the fast one, there is an illustrative counterexample p.lOll] in which 
the weak convergence holds but there is no strong convergence. 

Remark 5.2. In the present paper, we explore a strong limit theorem for the averaging 
principle for a class of two-time-scale SDEs with memory under certain dissipative conditions. 
Nevertheless, our main result can be generalized to some cases, where the fast motion does 
not satisfy a dissipative condition. Indeed, by a close inspection of the argument of Theorem 
15.11 to cope with the non-dissipative case, one of the crucial procedures is to discuss the 
ergodic property of the frozen equation without dissipativity. However, for some special 
cases, this problem has been addressed in Bao et ah [3]. 

Remark 5.3. As we mentioned in the Introduction section, the study on two-time-scale 
stochastic systems with memory is still in its infancy. So, there is numerous work to be 
done in the future. Here, we list some of them. For the fast component, in this work we 
concentrate on the case of point delay. So far, it seems hard to extend our main result 
to the general case, e.g., the distributed delay, where the main difficulty is to provide an 
error bound of the difference in the strong sense between the fast component {Y’^{t)) and its 
approximation Moreover, it is also very challengeable to reveal the rate of strong 

convergence established in Theorem 15.11 since the phase space of the segment processes is 
inhnite-dimensional. The questions above will be addressed in our forthcoming work. 
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